Negativity and contextuality are equivalent notions of nonclassicality 
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Two notions of nonclassicality that have been investigated intensively are: (i) negativity, that 
is, the need to posit negative values when representing quantum states by quasiprobability distri- 
butions such as the Wigner representation, and (ii) contextuality, that is, the impossibility of a 
noncontextual hidden variable model of quantum theory (also known as the Bell-Kochen-Specker 
theorem). Although both of these notions were meant to characterize the conditions under which 
a classical explanation cannot be provided, we demonstrate that they prove inadequate to the task 
and we argue for a particular way of generalizing and revising them. With the refined version of 
each in hand, it becomes apparent that they are in fact one and the same. We also demonstrate 
the impossibility of noncontextuality or nonnegativity in quantum theory with a novel proof that is 
symmetric in its treatment of measurements and preparations. 



It is common to assert that the discovery of quantum 
theory overthrew our classical conception of nature. But 
what, precisely, was overthrown? Being specific about 
the way in which a quantum universe differs from a clas- 
sical universe is a notoriously difficult task and continues 
to be a subject of ongoing research today. This problem 
has become one of practical concern in quantum informa- 
tion theory [l| and quantum metrology [2] as insights into 
the differences between the two theories help to identify 
and analyze information-processing tasks for which quan- 
tum protocols outperform their classical counterparts. 
The two notions of nonclassicality with which we shall 
be concerned in this article are negativity and contextu- 
ality, or more precisely, the presence of negative values in 
quasi-probability representations of quantum theory [3[ 
and the impossibility of noncontextual hidden variable 
models Q. We argue that these notions, construed in 
the traditional manner, are not sufficiently general and 
we promote a particular way of generalizing and revising 
them. In particular, we argue that nonnegativity in the 
distributions representing quantum states is not sufficient 
for classicality; the conditional probabilities representing 
measurements must also be nonnegative. Furthermore, 
we argue that a classical explanation cannot be ruled 
out by considering a single quasiprobability representa- 
tion, such as the Wigner representation 5]; negativity 
must be demonstrated to hold for all such representa- 
tions. Following previous work by the author [6| , we also 
argue that an assumption of determinism that is part of 
the traditional notion of noncontextuality should be ex- 
cised and that context-independence should be required 
not just for measurement procedures but for preparation 
procedures as well. Under these refinements, the two no- 
tions of nonclassicality are revealed to be equivalent. 

Negativity. In 1932, Wigner showed that one can 
represent a quantum state by a function on phase space, 
now known as the Wigner function, having the property 
that the marginals over all quadratures (linear combina- 
tions of position and momentum) reproduce the statistics 
for the associated quantum observables [|| • This function 
cannot, however, be interpreted as a probability distribu- 
tion over a classical phase space because for some quan- 



tum states it is not everywhere nonnegative. We shall 
say that such quantum states exhibit negativity in their 
Wigner representation. It is commonly thought that such 
negativity is a good notion of nonclassicality. However, 
we argue that it is neither a necessary nor a sufficient 
condition for the failure of a classical explanation. 

First, we show that it is not a necessary condition. It 
is well-known that the original Einstein-Podolsky-Rosen 
two-particle state has a positive Wigner representation 
Q so that it can be associated with a classical proba- 
bility density over the phase space of the two particles 
(i.e. over local hidden variables). However, it has also 
been shown that it is possible to violate a Bell inequality 
with such a state Q. How can this be? The resolution 
of the puzzle is that one can only have a classical inter- 
pretation of an experiment if both the preparations and 
the measurements admit a classical interpretation, and in 
the experiments in question, the measurements that one 
requires - such as parity measurements - do not admit 
such an interpretation because the Wigner representa- 
tions of the projectors have values outside of the interval 
[0, 1] and consequently cannot be interpreted as condi- 
tional probabilities. (A similar argument has been made 
in Ref. @.) 

Neither is the negativity of the Wigner representation 
sufficient for nonclassicality. For example, if one consid- 
ers a limited set of preparations and measurements for 
which the associated density operators and positive op- 
erator valued measures (POVMs) are diagonal in some 
orthogonal basis, then the diagonal components may be 
interpreted as classical probabilities, yielding a classical 
explanation of the experimental statistics. Nonetheless, 
if the diagonalizing basis does not consist of quadrature 
eigenstates - for instance, if it consists of number eigen- 
states - then the Wigner representations of these prepa- 
rations and measurements will not be positive. More gen- 
erally, negativity of the Wigner representation does not 
demonstrate that there isn't some other representation 
with respect to which one achieves a classical explana- 
tion. Note that what is classical about these explanations 
is their use of probability theory. We allow the space of 
physical states over which the probabilities are defined 
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to be arbitrary. 

Generalizing the notion of negativity. The les- 
son of the above examples is that in evaluating the pos- 
sibility of a classical explanation of an experiment, one 
must consider the negativity of not just the representa- 
tion of preparations but of measurements as well, and one 
must look at representations other than that of Wigner. 

There is a natural class of representations that in- 
cludes the Wigner representation and that allows one to 
preserve a notion of nonclassicality as negativity. We 
call these "quasiprobability representations" and define 
them by the following features. Every density operator 
p, a positive trace-class operator on a Hilbert space H, 
is represented by a normalized and real-valued function 
fi p on a measurable space A. That is, p <-> p P (X) where 
p p : A — > R and / p p (X)dX = 1. Similarly, every POVM 
{Ek}, a set of positive operators on H that sum to iden- 
tity, is represented by a set {S,E k } of real- valued func- 
tions on A that sum to the unit function on A. That is, 
{E k } «-> {£ Ek (A)} where : A - R and £ fe ^ Ek (A) = 1 
for all A e A. (The trivial POVM {7} is represented by 
£/(A) = 1, and the zero operator is represented by the 
zero function) Finally, the representation must be such 
that 

Tr ( P E k ) = J dA/*„(A)U(A). (f) 

There are infinitely many such representations one could 
define, but popular alternatives to Wigner include the Q 
and P representations of quantum optics. 

We define a nonnegative quasiprobability representa- 
tion of quantum theory as one for which 

MA) > 0, £e(A) > (2) 

for all density operators p and all positive operators E 
less than identity (i.e. all possible POVM elements). 
This would constitute a classical representation of all pos- 
sible preparations and measurements. 

Contextuality. The traditional notion of a noncon- 
textual hidden variable model of quantum theory can be 
expressed as follows Q . Denoting a complete set of vari- 
ables in the model (what Bell refers to as the "beables" 
of the model [l(|) by A, and the measurable space of 
these by A, one represents every pure quantum state \ip) 
by a normalized probability density on A, p^(X), and ev- 
ery projector- valued measure {life} (the spectral elements 
of a Hermitian operator) by a set {£n fc (A)} of {0,1}- 
valued indicator functions on A. An indicator function 
?n fc (A) specifies the probability of outcome k given A. 
Because some outcome must occur, indicator functions 
associated with a complete set of outcomes sum to 1, i.e. 
Sfc£n fc (A) = 1 for all A. A {0,l}-valued indicator func- 
tion is one for which £n fc (A) £ {0, 1}, so that the outcome 
of the measurement is determined by A (rather than being 
probabilistic). We refer to this restriction on indicator 
functions as the assumption of outcome determinism for 
sharp measurements. Note that this assumption is part 
of the traditional notion of noncontextuality (a point to 



which we shall return). Finally, in order for the hidden 
variable model to reproduce the probability of outcome k 
given ijj, one requires that J dX Pip(X)t;n k (A) = (^Ilfel^). 

To see why such a model is called "noncontextual" , 
note that whenever one of the Ilk has rank two or greater, 
it can be decomposed into a sum of smaller rank pro- 
jectors in many different ways, and each of these corre- 
sponds to a different way of implementing the measure- 
ment - a different context. The representation of the 
measurement in the hidden variable model is presumed 
to depend only on the Ilk, and not on how the measure- 
ment was implemented. The representation is therefore 
independent of the context, hence noncontextual. The 
Bell-Kochen-Specker theorem establishes that such a rep- 
resentation of quantum theory is impossible [4]. 

Generalizing the notion of contextuality. As ar- 
gued in detail in previous work the issue of whether 
a measurement's representation in the model is context- 
dependent or not can and should be separated from the 
issue of whether the outcome of the measurement is de- 
termined uniquely or only probabilistically by A. Thus, 
whereas traditionally the question of interest is whether 
or not the measurement outcome for a given A depends 
on the context of the measurement, we claim that the 
interesting question is whether the probabilities of differ- 
ent outcomes for a given A depend on the context. This 
is analogous to Bell's generalization of the notion of lo- 
cality from measurement outcomes being causally inde- 
pendent of parameter settings at space-like separation to 
the probabilities of measurement outcomes being so [To| . 
Mathematically, the proposed generalization corresponds 
to dropping the assumption of outcome determinism for 
sharp measurements from the definition of noncontextu- 
ality (by not requiring that £n fc (A) be {0, l}-valued). 

The fully general definition of a noncontextual hid- 
den variable model requires that all procedures - prepa- 
rations, transformations, and both projective and non- 
projective measurements - are represented in a manner 
that depends only on how the procedure is represented in 
the quantum formalism. If two procedures differ in ways 
that are not reflected in the quantum formalism we call 
this difference part of the context of the procedure. 

To be specific, the assumption of preparation noncon- 
textuality is that the probability distribution pp (A) asso- 
ciated with a preparation procedure P depends only on 
the density operator p associated with P, i.e. pp{X) — 
p p {X). For instance, if the preparation is a mixture 
of pure states \?pk} with weights Wk, then the distribu- 
tion depends only on the average density operator p = 
Wk\?pk)(ipk\ and not the particular ensemble. Sim- 
ilarly, the assumption of measurement noncontextuality 
is that the indicator function £m,/c(A) representing out- 
come k of a measurement procedure M depends only on 
the associated POVM element Ek, i.e., ^M,fc(A) = £,E k (X). 

In order to highlight the content of the assumption of 
noncontextuality, it is useful to formalize the assumptions 
that define a hidden variable model of quantum theory. 
In fact, the set of models that we characterize includes 
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the case wherein the quantum state is a complete descrip- 
tion of reality and so it is better to refer to these simply 
as "ontological models". Every preparation procedure 
P that is permitted by the theory is represented by a 
normalized and positive function on a measurable space 
A, and every measurement procedure M is represented 
by a set of positive functions on A that sum to unity. 
Specifically, we have P <-> fip(X), where /ip : A — > K 
such that J A /jLp(\)d\ — 1, and M <-> {^Af,fc(A)} 7 where 
£,M,k ■ A -> M such that ^fe^M.fcPO = 1 for all A e A, 
and 

A*p(A)>0, ^ M ,fc(A)>0. (3) 

Finally, let V p denote all preparation procedures consis- 
tent with the density operator p, and M{E k } ai l mea- 
surement procedures consistent with the POVM {Ek}. 
An ontological model of quantum theory is such that for 
all P e V p , and for all M 6 M {Ek }, 

J dXp P (X)^ Lk (X) = Tr {pEk) ■ (4) 

A noncontextual ontological model of quantum theory 
(in our generalized sense) is an ontological model that 
satisfies 

jUp(A) = n P (X) for all PeP p (5) 
WA) = Ze, (A) for all M eM {Ek} . (6) 

Eqs. ([5]) and ([6]) codify the assumptions of noncontextu- 
ality for preparations and measurements respectively. 

Whatever reasons one can provide in favor of the as- 
sumption of measurement noncontextuality (for instance, 
that it is the simplest possible explanation of the context- 
independence of the right-hand side of Eq. (fj|), the very 
same reasons can be given in favor of the assumption of 
preparation noncontextuality. Thus if one takes noncon- 
textuality for measurements as a condition for classical- 
ity, then noncontextuality for preparations should also be 
required. 

Equivalence of the two notions of nonclassical- 
ity. By substituting the conditions for preparation and 
measurement noncontextuality, Eqs. (JSJ) and ([5]), into the 
conditions for an ontological model, Eqs. © and ([3]), we 
obtain the conditions for a nonnegative quasi-probability 
representation of quantum theory, Eqs. §T§ and 0. So 
we see that by these definitions, a noncontextual onto- 
logical model of quantum theory exists if and only if a 
nonnegative quasi-probability representation of quantum 
theory exists. 

What we have discovered by this analysis is that 
the assumption of noncontextuality (in our generalized 
sense) has always been implicit in the notion of a quasi- 
probability representation. Given its conceptual signifi- 
cance and mathematical simplicity, it is surprising that 
this connection has not been noted previously. Two likely 
reasons for this are: (i) the lack of emphasis on the rep- 
resentation of measurements in discussions of negativ- 
ity, and (ii) the lack of a generalization of contextual- 
ity to preparations and nonprojective measurements and 



the failure to distinguish the assumption of measurement 
noncontextuality from that of outcome determinism. 

No-go theorems for nonnegativity or noncon- 
textuality. An ontological model of quantum theory 
that is noncontextual, in the generalized sense described 
here, is impossible [6]. It follows that a nonnegative 
quasiprobability representation of quantum theory is also 
impossible. This fact is unlikely to surprise those who 
know quantum theory well. Nonetheless, to our knowl- 
edge, it has not been demonstrated previously (although 
Montina [ill came close to doing so, as we discuss in the 
conclusions). 

An unfortunate feature of existing no-go theorems for 
noncontextual models is that they do not proceed directly 
from the assumption of generalized noncontextuality to 
a contradiction. For instance, in Ref. &], it is shown that 
one can base such a proof on the Bell-Kochen-Specker 
theorem; however, the contradiction is derived not only 
from the assumption of noncontextuality for sharp mea- 
surements, but also the assumption of outcome determin- 
ism for sharp measurements, and the latter assumption is 
in turn derived from noncontextuality for preparations. 
So, despite the standard impression that these no-go the- 
orems concern only the representation of measurements, 
we see that the representation of preparations enters the 
analysis in an indirect way. Similarly, in no-go theorems 
that appeal only to the assumption of noncontextuality 
for preparation procedures (0, Sec. IV), one still relies 
on the fact that only preparations associated with prob- 
ability distributions that are nonoverlapping can be dis- 
criminated by a single-shot measurement. Thus the rep- 
resentation of measurements has appeared, in an indirect 
way, within a proof based primarily on the representa- 
tion of preparations. A proof that is even-handed in its 
treatment of preparations and measurements would be 
preferable and we now provide one. 

An even-handed no-go theorem for nonnegativ- 
ity or noncontextuality. Suppose that one has a set 
of preparation procedures associated with density oper- 
ators pj. A procedure associated with the mixture p = 
w jPj can be implemented as follows. Sample an inte- 
ger j from the probability distribution wj (by rolling a die 
for instance), and implement the preparation procedure 
associated with pj . If the distributions over A that rep- 
resent each of these procedures in the ontological model 
are denoted by p, Pj (A) , then clearly the distribution that 
represents the mixture p is p p (X) — J^. u>jp Pj (A). Thus 
in a noncontextual ontological model, 

if P = w iPh then M A ) = X! w ^Pi ( A )' ( 7 ) 
j j 

A similar argument concerning a mixture of measure- 
ments, each of which has a distinguished outcome asso- 
ciated with a positive operator Ej, establishes that in a 
noncontextual ontological model, 

if E = w 3 Ej then £ B (A) = ^ w£ Ei (A). (8) 

j 3 
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A real-valued function / on the space C(H) of linear 
operators on TL is convex-linear on a convex set S C C(TL) 
if f(J2k w kRk) = J2k w kf(Rk) for R k G S and Wk a 
probability distribution over k. Eqs. © and © assert 
that /i as a function of p is convex-linear on the convex set 
of density operators, and £ as a function of E is convex- 
linear on the convex set of positive operators less than 
identity (the set of "effects"). 

The first two steps of the no-go theorem are famil- 
iar as key elements of the gene ralization of Gleason's 
theorem jll] to POVMs (HQS and analo gous reason- 
ing plays an important role in Hardy's axiomatization of 
quantum theory [TH ]. The first step is to note that a 
function / that is convex-linear on a convex set S of op- 
erators that span the space of Hermitian operators (and 
that takes value zero on the zero operator if the latter is 
in S) can be uniquely extended to a linear function on 
this space. Specifically, if A is a Hermitian operator that 
can be decomposed as A = *^2 k a,kRk, where Rk G S, 
then the extension is f(A) = J2k a kf(Rk) [Hj]- 

The second step is to note that by Reisz's representa- 
tion theorem, the linear function f(A) can be written as 
the Hilbert-Schmidt inner product of A with some fixed 
Hermitian operator, say B\ so that f(A) = Tr(AB). 

It follows that 

M„(A) = Tr[pF(A)], £e(A) = Ti[a(\)E], (9) 

where F and a are functions from A to the Hermitian 
operators on TL. 

A noncontextual ontological model, or cquivalently, a 
nonnegative quasiprobability representation, is one for 
which /i p (A) > and £e(A) > for all p and E, which 
implies that the operators P(A) and cr(A) are not merely 
Hermitian but positive as well. Given that J p p (X)dX = 
1, it follows that f P(A)dA = /, and so we can conclude 
that F(X)dX is a POVM. Furthermore, given that £/(A) = 
1 for all A, we also have Tr[cr(A)] = 1 and so we can 
conclude that a is a map from A to density operators. 

We now show, using a proof by contradiction, that 
p p (X) and Cs(A) of this sort cannot reproduce the quan- 
tum predictions. To do so, p p {X) and £e(X) would 
need to satisfy Eq. ([T]), which implies, via Eq. ©, that 
/ <2ATr[pP(A)]Tr[er(A)P] = Tr(pE) for all p and E. There 
are many ways of deriving a contradiction from here. 
Given that the set of density operators spans the space 
CiTL), we can infer that E = J cZA£e(A)F(A), i.e. every 
E is a positive combination of the F (A). Now consider a 
POVM {Ek} with rank-1 elements. Any given element 
Ek is a positive combination of the F (A), specifically, 
Ek = J dX^E k (X)F(X). However, a rank-1 positive oper- 



ator admits only trivial positive decompositions into pos- 
itive operators (namely, into ones that are proportional 
to itself). It follows that F(X) oc Ek for all A in the sup- 
port of ^E k ■ Recalling that for every A G A, there exists a 
k such that A is in the support of £eh j it follows that for 
every A G A, there exists a k such that P(A) oc Ek. Re- 
peating the argument for another POVM with rank-1 ele- 
ments, say {E'j}, we conclude that for every A G A, there 
exists a j such that F(X) oc E'y However, given that no 
element of {Ek} needs to be proportional to any element 
of {E'j} (for instance, they may be the projector- valued 
measures corresponding to two bases having no elements 
in common), we arrive at a contradiction. [Noting from 
Eqs. (P) and © that {F(X)} and {cr(A)} are dual frames 
in the operator space, this result implies that the dual of 
a frame of positive operators cannot also be a frame of 
positive operators. A direct proof of this fact is possible 
fig and provides a faster route to the contradiction.] 

A similar argument to the one just provided can be 
found in the recent work of Montina [111 ], where it is 
demonstrated that to avoid negative probabilities in on- 
tological models, the representation of pure states can- 
not depend bilinearly on the wavefunction. Although the 
representation of mixed quantum states is not discussed, 
it is a short step from this result to a demonstration of 
a failure of preparation noncontextuality. The reason is 
as follows. As argued earlier, if p Pi {X) is the representa- 
tive of the procedure P, associated with the pure state 
Pi, then the procedure that implements Pi with prob- 
ability Wi is represented by Yli w if J, PiW- However, if, 
as Montina's work entails, p Pi (A) is not a linear func- 
tion of pi, then J2iWip Pi (X) cannot depend merely on 
the density operator YN w iPi that is associated with the 
mixed preparation, but must also depend on the context 
of this preparation - in particular, the pure state ensem- 
ble from which the mixed state was formed. Therefore 
these results are seen to provide another demonstration 
of the impossibility of a model that is noncontextual in 
the generalized sense discussed here. 

Attempts to characterize nonclassicality from either 
the perspective of hidden variables or that of quasiprob- 
ability representations drive one to the same conclusion: 
that the only way in which one can salvage the possibility 
of an ontological model is to deny the implicit starting 
point of these representations, the assumption of noncon- 
textuality. 
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